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5 

METHODS FOR IDENTIFYING COMPOUNDS THAT AFFECT RNA/RNA BINDING PROTEIN INTERACTIONS 

Field of the Invention 

10 This invention relates to the field of nucleic acid regulatory elements 

that affect post-transcriptional regulation of protein expression. More 
specifically, the invention features nucleic acid sequences, located in the 
mRNA untranslated regions of various therapeutically-relevant genes which 
specifically bind RNA binding proteins (RBPs) or have other roles in the 

15 regulation of protein expression from the linked RNA. These sequences are 
used in screening assays, for example, to identify compounds that affect the 
RNA/RBP binding pair interaction. Such compounds can be administered 
therapeutically to regulate protein expression. 

20 Background of the Invention 

The regulation of protein expression can occur at a number of levels: 
transcriptional, post-transcriptional, or post-translational. The modulation of 
protein expression can produce significant benefits in the treatment of disease. 
Accordingly, recent efforts to achieve such regulatory control have focused on 

25 the development of small molecules that regulate transcription factors as well 
as antisense molecules that inhibit protein expression. While the transcription 
factor approach holds promise, there are drawbacks. For example, a compound 
which modulates a transcription factor may lack specificity and require nuclear 
localization; both of these limitations are problematic for those in the field of 

30 drug development. For example, a drug that affects the binding of a targeted 
transcription factor and has beneficial results may also affect transcription of 
many other genes in a deleterious manner. In addition, there are difficulties in 
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designing a drug that both effectively interacts with the target and is 
transported into the nucleus. Targeting RNA with antisense oligonucleotides to 
inhibit protein expression also has drawbacks; the approach is restricted to 
strategies for decreasing protein expression and may be limited by relatively 

5 poor intracellular transport of oligonucleotides. 

Another possible approach to modifying post-transcriptional protein 
expression in eukaryotic cells involves targeting the specific interaction of 
proteins that bind RNA (RNA binding proteins or RBPs) with the RNA 
molecules. These RBPs appear to mediate the processing of pre-mRNAs, the 

10 transport of mRNA from the nucleus to the cytoplasm, mRNA stabilization, 
translational efficiency, and the sequestration of some mRNAs. The most 
common RBP motifs are the RNP motif, the Arg-rich motif, the RGG box, the 
KH motif, and the double-stranded RNA-binding motif (Burd and Dreyfuss, 
Science 265: 615-621, 1994). Some key factors in RNA/RBP interactions 

15 involve sequence and structure recognition, positional effects, and pre-binding 
to Z-DNA (Herbert et al., Proc. Natl. Acad. Sci. USA 92: 7550-7554, 1995; 
Maker 246: 664-666, 1989). 

Nucleic acid sequences and screening methods for detecting 
compounds which alter the RNA/RBP binding pair interaction would greatly 

20 speed the discovery of therapeutic drugs which alter protein expression. 

Summary of the Invention 

In general, the invention features nucleic acid sequences and their 
optimized subfragments that are located in the mRNA untranslated regions of 
25 various, therapeutically-relevant genes. These sequences specifically bind 
RNA binding proteins (RBPs) and/or are involved in mRNA functionality. 
Also featured are methods of truncating parent sequences to create optimized 
subfragments. These parent sequences and optimized sequences are used as 
RNA molecule targets to screen for compounds that affect the RNA/RBP 
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binding pair interaction or mRNA functionality. Compounds identified by the 
screening process can be administered therapeutically to regulate protein 
expression. 

In a first aspect, the invention features a nucleic acid sequence 
5 comprising one of the nucleic acid sequences of SEQ ID NOS: 1-20, or a 
subfragment nucleic acid sequence of SEQ ID NOS: 1-20, wherein an mRNA 
molecule comprising the sequence has RNA binding protein (RBP) binding 
activity or regulates the functionality of the mRNA. These nucleic acid 
sequences are derived from the following gene sequences: the human 

10 interleukin-6 receptor, Accession #'s X12830, M20556 (SEQ ID NO: 1); 
human cyclooxygenase-2, Accession # M90100 (SEQ ID NO: 2 and SEQ ID 
NO: 1 1); human ubiquitin, Accession # U49869, X04803 (SEQ ID NO: 3); 
human uracil DNA glycosylase-1 (Ung-1), Accession # X89398 (SEQ ID NO: 
4); human excision repair controlling gene, Accession # U16815 (SEQ ID NO: 

15 5); human cytochrome C oxidase 6b, Accession # D28426 (SEQ ID NO: 6); 
human cytochrome C oxidase 5b, Accession # U41284 (SEQ ID NO: 7); 
human Beta-2 adrenergic receptor, Accession # M15169, J02728, M16106 
(SEQ ID NO: 8); human breast cancer susceptibility-2 (BRCA2), Accession # 
U43746 (SEQ ID NO: 9); human interleukin-4, Accession # M23442 (SEQ ID 

20 NO: 10); human vascular cell adhesion molecule- 1, Accession # M73255 (SEQ 
ID NO: 12 and SEQ ID NO: 13); human interleukin-1 1, Accession # 81890 
(SEQ ID NO: 14); human macrophage CSF receptor (c-fms proto-oncogene), 
Accession # X03663 (SEQ ID NO: 15); human interleukin-9, Accession # 
M86593, M55519 (SEQ ID NO: 16); human leptin (murine obesity 

25 homologue), Accession # NM_000230 (SEQ ID NO: 17); rat neuropeptide Y5 
receptor, Accession # U66274 (SEQ ID NO: 18); rat orexin receptor- 1, 
Accession # AF041 1244 (SEQ ID NO: 19); and mouse mahogany protein, 
Accession # AF1 16897 (SEQ ID NO: 20). Preferably, the subfragment nucleic 
acid sequence is optimized. Preferably, the mRNA functionality of the parent 
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or subfragment sequence involves an alteration in pre-mRNA processing or in 
the stabilization, translational efficiency, localization, sequestration, editing, or 
splicing functions of the UTR-associated mRNA, more preferably, the 
alteration in pre-mRNA processing or in the stabilization, translational 

5 efficiency, localization, sequestration, editing, or splicing functions of the 
UTR-associated mRNA comprises a change of at least 20% above or below the 
corresponding value for a control mRNA that lacks the sequence. 

In a second aspect, the invention provides a method of identifying an 
optimized subfragment of any one of the parent nucleic acid sequences of SEQ 

10 ID NOS: 1-20, wherein the method involves isolating a subfragment nucleic 
acid sequence from the parent nucleic acid sequence, assaying RNA molecules 
comprising the subfragment for RBP binding activity or mRNA functionality, 
and identifying a subfragment nucleic sequence that maintains an RBP binding 
activity and/or mRNA functionality that is equivalent to the parent sequence. 

15 Preferably, the subfragment nucleic acid sequence is isolated by restriction 
enzyme digestion, and the subfragment is identified by deletion mapping. 
Preferably, the mRNA functionality of the sequences involves an alteration in 
pre-mRNA processing or in the stabilization, translational efficiency, 
localization, sequestration, editing, or splicing functions of the UTR-associated 

20 mRNA, more preferably, the alteration in pre-mRNA processing or in the 
stabilization, translational efficiency, localization, sequestration, editing, or 
splicing functions of the UTR-associated mRNA comprises a change of at least 
20% above or below the corresponding value for a control mRNA that lacks the 
sequence. 

25 A third and related aspect of the invention provides a nucleic acid 

sequence identified as an optimized subfragment of any one of SEQ ID NOS: 
1-20 by a method which involves isolating a subfragment nucleic acid sequence 
from the parent nucleic acid sequence, assaying RNA molecules comprising the 
subfragment for RBP binding activity or mRNA functionality, and identifying a 
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subfragment nucleic sequence that maintains an RBP binding activity and/or 
mRNA functionality that is equivalent to the parent sequence. Preferably, the 
mRNA functionality of the parent and subfragment sequences involves an 
alteration in pre-mRNA processing or in the stabilization, translational 
5 efficiency, localization, sequestration, editing, or splicing functions of the 
UTR-associated mRNA, more preferably, the alteration in pre-mRNA 
processing or in the stabilization, translational efficiency, localization, 
sequestration, editing, or splicing functions of the UTR-associated mRNA 
comprises a change of at least 20% above or below the corresponding value for 

10 a control mRNA that lacks the sequence. 

The fourth aspect features a method of identifying a candidate 
compound having an effect on an RNA/RBP binding pair interaction or mRNA 
functionality, wherein the method involves contacting an RNA molecule 
comprising at least one of the nucleic acid sequences of SEQ ID NOS: 1-20, or 

15 at least one optimized subfragment sequence of SEQ ID NOS: 1-20, with at 
least one RBP, and at least one test compound, and measuring the RNA/RBP 
binding pair interaction and/or mRNA functionality, wherein a candidate 
compound is identified as a test compound that affects the interaction and/or 
functionality. Preferably, the mRNA functionality of the parent or subfragment 

20 sequence involves an alteration in pre-mRNA processing or in the stabilization, 
translational efficiency, localization, sequestration, editing, or splicing 
functions of the mRNA, more preferably, the alteration in pre-mRNA 
processing or in the stabilization, translational efficiency, localization, 
sequestration, editing, or splicing functions of the UTR-associated mRNA 

25 comprises a change of at least 20% above or below the corresponding value for 
a control mRNA that lacks the sequence. 

The fifth aspect of the invention provides a method for identifying an 
RBP that interacts with an RNA molecule comprising the nucleic acid 
sequence, or an optimized subfragment sequence, of any one of SEQ ID NOS: 
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1-20, wherein the method involves contacting the RNA molecule with at least 
one RBP, and measuring RNA/RBP binding pair interactions. The detection of 
an RNA/RBP binding pair interaction identifies the RPB that interacts with the 
RNA molecule. 

5 - 

By "RNA/RBP binding pair interaction" is meant a physical 
association between an RNA molecule and an RBP, or an RBP complex made 
up of more than one protein, that is based on the specific characteristics of the 
interacting molecules, and is not inhibited by non-specific competitor 

10 molecules present at a concentration equivalent to the interacting molecules. 
The RNA and RBP molecules that form the RNA/RBP binding pair interaction 
can be separated from their counterpart, non-associated molecules by filter 
binding assay, electrophoretic mobility assay, homopolymer beads, or 
fluorescent anisotrophy assay. 

15 By "RBP binding activity" is meant the specific binding affinity of 

an RNA molecule comprising a certain nucleic acid sequence for the RBP with 
which it forms an RNA/RBP interacting pair. The RBP binding activity is 
specific for a particular RNA molecule if the RNA binds an RBP or RBP 
complex with at least 10-fold higher affinity than does a non-relevant RNA 

20 molecule with the same G-C content. The affinity measures can be determined 
by filter binding assay, electrophoretic mobility assay, homopolymer beads, or 
fluorescent anisotrophy assay. The specificity of the interaction between an 
RNA and an RNA binding protein can be determined, for example, by adding 
100 and 1,000 fold molar excess of unlabeled identical RNA (specific 

25 competitor) or the same excess of different RNA (non-specific competitor) to 
the binding reaction, and assaying for binding activity, for example, using 
assays described herein. A concentration-dependent inhibition of complex 
formation is observed with unlabeled identical RNA while no competition is 
seen when different RNA is added in excess. 
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By "mRNA functionality" is meant the ability of an mRNA molecule 
comprising a certain nucleic acid UTR sequence in association with a protein's 
coding region to alter the protein expression from the linked coding region by 
altering the pre-mRNA processing, or the stabilization, translational efficiency, 
5 localization, sequestration, or editing and splicing functions of the mRNA. The 
mRNA functionality can be assessed, for example, by measuring the half-life of 
the transcript (Saulnier-Blache et ah, Mol. Pharmacol. 50: 1432-1442, 1996; 
Yang et aL, J. Biol. Chem. 272: 15466-15473, 1997), polysomal distribution 
along the transcript (Izquierdo et al., Mol. Cell Biol. 17: 5255-5268, 1997; Luis 

10 et aL, J. Biol. Chem. 268: 1868-1875, 1993; Santaren et al., J. Biochem. 113: 
129-131, 1993), or the transcript's intracellular distribution (Yang et aL, 1997, 
supra). Any of the above measures of function for a UTR-associated transcript 
that differs by 20% or more above or below the value for the corresponding 
UTR-free transcript indicates that the UTR has mRNA functionality. An 

15 increase in the half-life (or a decrease in degradation rate) of the full length 
transcript indicates an increase in mRNA stability; an increase in transcript 
length indicates an increase in translational efficiency; and an increase in a 
transcript's relative distribution in the cytosol indicates in increase in transport 
out of the nucleus. 

20 By "equivalent to said parent sequence" is meant RBP binding 

activity or mRNA functionality differing by no more than 50%, preferably, by 
no more than 30%, and, more preferably, by no more than 10% from the 
corresponding activity or functionality in the parent sequence. 

By a "nucleic acid subfragment" is meant a shorter sequence derived 

25 from a longer parent sequence. The subfragment may be shortened by cleaving 
the 5' or the 3' end of the parent, or by internal deletion of a parent sequence, or 
by any combination of the above. 
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By an Optimized" nucleic acid sequence is meant a nucleic acid 
subfragment of a longer parent sequence, wherein the subfragment still 
maintains an RNA/RBP binding activity and/or mRNA functionality that is 
equivalent to the parent nucleic acid sequence. An optimized sequence can 
5 result from cleavage of the parent sequence at the 5' and/or 3' ends, from an 
internal deletion, or from a combination of the above. Alternatively, such a 
nucleic acid sequence may be synthesized. 

A nucleic acid molecule or nucleic acid segment referred to as 
having a specific nucleic acid sequence is intended to mean a nucleic acid 
10 molecule in any of its corresponding forms, for example, DNA, cDNA, RNA, 
or mRNA. 

Other features and advantages of the invention will be apparent from 
the following detailed description/and from the claims. 

15 Detailed Description 

RNA/RBP binding pair interactions are known to regulate protein 
expression through the modulation of RNA stabilization, translational 
efficiency, RNA localization, RNA transcription, and/or RNA editing and 
splicing processes. Given the importance of RNA/RBP binding pair 

20 interactions in the regulation of protein expression, they are attractive targets 
for the drug screening assays of the present invention. A compound identified 
in these screens as having an effect on a particular RNA/RBP binding pair 
interaction, or on the mRNA functionality, can modulate the expression of a 
gene product that is endogenously associated with the RNA sequence screened. 

25 Therefore, such compounds can be administered as a therapeutic for diseases 
associated with this protein. 

We have discovered certain nucleic acid sequences, derived from the 
mRNA untranslated regions (UTRs) of a variety of therapeutically-relevant 
genes, that specifically bind RNA binding proteins (RBPs) (SEQ ID NOS: 1- 
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20). We have also discovered methods of subdividing such RBP-binding 
nucleic acid sequences (parent sequences) to obtain shorter nucleic acid 
sequences that maintain an RNA/RBP binding pair interaction or mRNA 
functionality that is equivalent to the parent sequences. 
5 The mRNA functionality of the parent or optimized sequences 

involves the ability of an mRNA molecule, comprising a certain nucleic acid 
UTR sequence in association with a protein's coding region, to alter the protein 
expression from the linked coding region by altering the pre-mRNA processing, 
or the stabilization, translational efficiency, localization, sequestration, or 

10 editing and splicing functions of the mRNA. 

The smaller, or optimized sequences, can be used as substitutes for 
the parent sequences for the purposes further discussed below. If further 
confirmation of whether the optimized sequence is an adequate substitute for 
the parent sequence is desired, either the RBP binding activity or the mRNA 

15 functionality of the optimized sequence, or both properties, can be assessed to 
more predictably determine whether the optimized sequence will function in a 
manner that is relevant to the parent sequence. These optimized sequences 
have the advantage of having reduced nonspecific protein binding and of 
isolating one or more of the regulatory activities of the RNA molecules so that 

20 therapeutics with specificity for a given activity can be readily identified. 

The parent sequences of the present invention, or their optimized 
subfragments, can be translated into mRNA transcripts for the following: 1) to 
screen for compounds that affect the RBP binding activity of a particular 
RNA/RBP binding pair interaction, and/or the mRNA functionality; 2) to 

25 identify novel RNA/RBP binding pair interactions; and 3) to modify the protein 
expression of a heterologous gene. These methods are further discussed 
below. 
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The Parent Sequences 

We have identified twenty nucleic acid sequences that possess 
specific RBP binding ability. The twenty sequences are found in the mRNA 
UTR regions from eighteen different genes that were initially chosen for study 
5 because of their biological importance. The 5' and 3' UTRs were identified 
from database sequences having the following Accession #'s and encoding the 
following gene products: the human interleukin-6 receptor, Accession #'s 
X12830, M20556 (SEQ ID NO: 1); human cyclooxygenase-2, Accession # 
M90100 (SEQ ID NO: 2 and SEQ ID NO: 1 1); human ubiquitin, Accession # 

10 U49869, X04803 (SEQ ID NO: 3); human uracil DNA glycosylase-1 (Ung-1), 
Accession # X89398 (SEQ ID NO: 4); human excision repair controlling gene, 
Accession # U 16815 (SEQ ID NO: 5); human cytochrome C oxidase 6b, 
Accession # D28426 (SEQ ID NO: 6); human cytochrome C oxidase 5b, 
Accession # U41284 (SEQ ID NO: 7); human Beta-2 adrenergic receptor, 

15 Accession # M15169, J02728, M16106 (SEQ ID NO: 8); human breast cancer 
susceptibility-2 (BRCA2), Accession # U43746 (SEQ ID NO: 9); human 
interleukin-4, Accession # M23442 (SEQ ID NO: 10); human vascular cell 
adhesion molecule- 1, Accession # M73255 (SEQ ID NO: 12 and SEQ ID NO: 
13); human interleukin-1 1, Accession # 81890 (SEQ ID NO: 14); human 

20 macrophage CSF receptor (c-fms proto-oncogene), Accession # X03663 (SEQ 
ID NO: 15); human interleukin-9, Accession # M86593, M55519 (SEQ ID NO: 
16); human leptin (murine obesity homologue), Accession # NM_000230 (SEQ 
ID NO: 17); rat neuropeptide Y5 receptor, Accession # U66274 (SEQ ID NO: 
18); rat orexin receptor- 1, Accession # AF041 1244 (SEQ ID NO: 19); and 

25 mouse mahogany protein, Accession # AF1 16897 (SEQ ID NO: 20). The 
sequences of the present invention were amplified by PCR from cellular DNA, 
genomic DNA, or cDNA made from a relevant cell line by isolation of total 
RNA and reverse transcription. 
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The RBP binding activity of the UTR sequences was demonstrated 
as follows. The UTR sequence was placed under the control of a promoter for 
in vitro transcription, such as an SP6 or T7 promoter, and RNA was produced 
and radioactively labeled. The RNA was then mixed with a protein extract 

5 from an appropriate cell line that endogenously expresses the UTR-associated 
gene, for example, U937 and K562 cell lines were used for inflammatory 
genes, the 3T3-L1 cell line was used for leptin and mahogany genes, and rat 
brain extract was used for rat neuropeptide Y5 receptor and orexin receptor- 1 
genes. The protein extract was prepared by lysing cells with digitonin in buffer 

10 containing 25 mM Tris, pH 7.9, 0.5 mM EDTA, 0. 1 mM PMSF, 2 mM NaF, 2 
mM NaPPi, and 1-10 /xg/ml of the protease inhibitors aprotinin, leupeptin, and 
pepstatin. Extracts were centrifuged twice at 16,000 x g for 15 minutes at 4°C 
to remove nuclei, aliquoted, snap frozen on dry ice, and stored at -80 °C until 
use. 

15 The binding pair interactions between the UTR sequences of the 

invention and RBP's in the cellular extracts were facilitated by a binding 
solution containing 7.5 mM Bis-Tris Propane, pH 8.5, 50 mM K + , 1 mM Mg^ 
0.2 mM DTT, and 10% (v/v) glycerol. Poly r(G), tRNA, heparin molecules, 
and unrelated RNA molecules of similar length were used as non-specific 

20 competitors of the RNA/RBP binding pair interactions. The interactions were 
detected by filter binding assay or electrophoretic gel mobility shift assay. 

Techniques for in vitro transcription of RNA molecules and 
methods for cloning genes encoding known RNA molecules are described in 
the literature (Sambrook et al.) and commercial kits are available (mCap RNA 

25 capping kit, Stratagene). Other methods of preparing cell extracts and 

detecting RNA/RBP binding pair interactions are described in the literature 
(see, e.g., WO 98/04923) and are further discussed below with regard to the 
optimization of fragments and screening assays. 
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Optimization of Subfragments 

Creation of Subfragments 

To optimize the parent sequences of SEQ ID NOS: 1-20, or other 
parent UTR-derived nucleic acid sequences, standard restriction mapping 
5 techniques are used to create smaller sequences that maintain an RBP binding 
activity and/or mRNA functionality equivalent to the parent sequences. 
Alternatively, PCR amplification using appropriate probes or organic synthesis 
may be used. 

In the restriciton enzyme approach, the parent nucleic acid sequence 
10 is linearized, if necessary, and divided to create subfragments. Different 

combinations of restriction enzymes are used to digest the parent sequences so 
that a variety of overlapping sequences are created for each parent sequence. In 
vitro transcription is carried out to produce mRNA transcripts encoding the 
subfragments as well as the parent sequences. These transcripts are used for 
15 testing RBP binding activity and/or mRNA functionality. Subfragments, or 
combinations of subfragments, which are identified as having RBP binding 
activity and/or RNA functionality equivalent to the parent sequences, are 
optimized, and can be further digested such that smaller and smaller optimized 
subfragments are identified. 

20 

RNA/RBP Binding Pair Interactions 

The RBP binding activity of the nucleic acid subfragments can be 
assessed and compared to their respective parent sequences by measuring the 
RNA/RBP binding pair interaction for each of the nucleic acid sequences. 
25 Subfragments with RBP binding activity equivalent to the parent sequence are 
considered to be optimized by this process. The methods of detecting such 
RNA/RBP binding pair interactions are well known in the art, and include, for 
example, filter binding assays (Wu and Uhlenbeck, Biochemistry 26: 8221- 
8227, 1987; Carey and Uhlenbeck, Biochemistry 22: 2610-2615, 1983), 
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electrophoretic gel mobility shift assays (Izquierdo and Cuezva, Mol. Cell. 
Biol. 17:5255-5268, 1997; Malter, Science 246: 664-666, 1989; Zaidi and 
Maker, J. Biol. Chem. 269: 24007-24013, 1994; Claffey et al., Mol. Biol. Cell 
9: 469-481, 1998; Brewer, Mol. Cell. Biol. 11: 2460-2466, 1991); 

5 homopolymer beads (Siomi et al., Cell 77: 33-39, 1994), or fluorescence 
anisotrophy (Tetin et al., Biochemistry 32: 9011-9017, 1993; Goss et al., 
Nucleic Acids Research 11: 5589-5602, 1983; and Liang et al., WO 98/39484). 

It is preferred that conditions allowing detection of interactions 
between nearly every type of RNA and RBP pair be employed. Exemplary 

10 protocols, binding conditions, and RNA binding proteins that may be used are 
disclosed in detail in PCT application WO 98/04923 and are summarized 
below. 

The preferred conditions allow detection of a majority of RNA/RBP 
interactions. The interactions are facilitated in a binding solution that includes 

15 a buffer, a monovalent cation, a divalent cation, a reducing agent, and a density 
agent. The basic method includes forming a binding solution containing the 
RNA molecules and binding buffer, heating the solution to denature the RNA, 
cooling the solution to the reaction temperature to fold the RNA in proper 
formation, adding RBPs, and detecting the interactions using any suitable 

20 procedure. The specificity of binding pair interactions is assessed by 

comparing the binding in the presence of specific and nonspecific competing 
RNA. If desired, a competitor of nonspecific RNA/RBP interactions, such as 
poly r(G), tRNA, heparin, or unrelated RNA molecules of similar length can be 
added to the binding solution to reduce the background of nonspecific binding. 

25 It is preferred that detection involve the separation of interacting RNA 

molecules and RBPs, such as on the basis of size or physical properties. Two 
preferred methods are filter binding and gel mobility shift. 
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Detection of interactions between RNA binding proteins and RNA 
molecules can be facilitated by attaching a detectable label to the RNA 
molecule. Generally, labels known to be useful for nucleic acids can be used to 
label RNA molecules, including, for example, isotopes such as 33 P^ 32 P, and 35 S, 
5 fluorescent labels such as fluorescein (FITC), 5, 6-carboxy methyl fluorescein, 
Texas red, nitrobenz-2-oxa-l,3-diazol-4-yl (NBD), coumarin, dansyl chloride, 
rhodamine, 4'-6-diaminidino-2-phenylinodole (DAPI), the cyanine dyes (Cy3, 
Cy3.5, Cy5, Cy5.5, and Cy7), and biotin. 

Labeled nucleotides are the preferred form of label since they can be 

10 directly incorporated into the RNA molecules during synthesis. Examples of 
labeled nucleotides include BrdUrd (Hoy and Schimke, Mutation Research 
290: 217-230 (1993)), BuUTP (Wansick et al., J. Cell Biology 122:283-293 
(1993)) and nucleotides modified with biotin (Langer et al., Proc. Natl. Acad. 
Sci. USA 78:6633 (1981)) or with suitable haptens such as digoxygenin 

15 (Kerhof, Anal. Biochem. 205:359-364 (1992)). Suitable fluorescence-labeled 
nucleotides are Fluorescein-isothiocyanate-dUTP, Cyanine-3-dUTP and 
Cyanine-5-dUTP (Yu et al., Nucleic Acids Res. 22:3226-3232 (1994)). A 
preferred nucleotide analog label for RNA molecules is Biotin- 14-cytidine-5' - 
triphosphate. Fluorescein, CY3, and Cy5 can be linked to dUTP for direct 

20 labeling. Cy3.5 and Cy7 are available as avidin or anti-digoxygenin conjugates 
for secondary detection of biotin- or digoxygenin-labeled probes. 

The RBPs used for optimization can be part of a crude or purified 
cellular or nuclear extract, and can be used either in isolation or in combination. 
These RBPs can be prepared using known methods of protein extraction and 

25 purification (Ashley et al., Science 262: 563-566, 1993; Rouault et al., Proc. 
Nat. Acad. Sci. USA 86: 5768-5772, 1989; Neupert et al., Nucleic Acids 
Research 18: 51-55, 1990; Zhang et aL, Mol. Cell. Biol. 13: 7652-7665, 1993; 
and Burd and Drey fuss, Science 265: 615-21, 1994). Alternatively, known 
RBPs can be produced recombinantly using standard techniques. DNA 
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encoding RNA binding proteins can be obtained from available clones, by 
synthesizing a DNA molecule encoding an RNA binding protein with a known 
amino acid sequence, or by cloning the gene encoding the RNA binding 
protein. Techniques for recombinant expression and methods for cloning genes 

5 encoding known proteins are well known (Sambrook et aL, Molecular Cloning, 
Cold Spring Harbor Laboratory, 1989). 

Detection of interactions between RNA binding proteins and RNA 
molecules can also be facilitated by attaching a detectable label to the RBP. 
Preferred labels include 125 I, 3 H, 35 S, and, in the case of recombinant proteins, 

10 they can be incorporated through the use of labeled amino acids. Techniques 
for labeling and detecting proteins are known in the art (Sambrook et al. and 
Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, 
Inc., 1996). Detection of an RBP can also be achieved by the use of an RBP 
specific antibody (Johnstone and Thorpe, Immunochemistry in Practice, 

15 Blackwell Scientific Publications, 1997). 

Functionality 

The mRNA functionality of the subfragment sequences, as compared 
to their corresponding parent sequences, can be used as an independent tool for 

20 identifying optimized subfragments with function equivalent to their parent 
sequences. Alternatively, the mRNA functionality studies can be used in 
combination with RNA/RBP binding pair interaction studies to further verify 
that the optimized sequences are adequate substitutes for the parent sequences. 
Studies of mRNA functionality involve, for example, assessments of the 

25 sequences' effects on translational efficiency, mRNA stability, and transcript 
intracellular transport, as well as the editing and splicing of RNA. 

To determine whether a parent sequence regulates mRNA 
functionality, or whether a subfragment sequence regulates functionality in a 
fashion equivalent to the parent, constructs are generated in which the parent or 



15 



WO 01/34624 




PCT/US00/30888 



subfragment UTR sequence to be tested is adjoined to the coding segment of a 
gene at a position upstream or downstream of the coding region, depending on 
whether the UTR is derived from a 5* or 3' UTR, respectively. The coding 
region may encode a reporter gene, or the gene that is endogenously associated 
5 with the UTR sequence with an inserted tag, such as an epitope tag. 

Intracellular expression of the construct can be achieved by transfecting cells 
with expression vectors containing the construct. If the parent or subfragment 
sequence is adjoined to a reporter gene, then changes in the level of reporter 
gene product can be used as an indication that the parent or subfragment 

10 sequence regulates at least one aspect of mRNA functionality. 

When assessing the functionality of a parent sequence, transcripts 
from constructs which contain the coding region, but lack the UTR sequence, 
are used as controls. To determine whether a subfragment maintains a function 
equivalent to the parent sequence, transcripts from a construct containing the 

15 full-length parent sequence are used as controls. 
mRNA Stability 

The stability of mRNA can be determined using either in vivo or in 
vitro transcribed mRNA. In cells, stability is determined by first blocking 
transcription in transfected cells with a compound such as actinomycin D (5 

20 /ig/ml), and then measuring the degradation rate of the transcripts by 

quantitating their level in cells harvested at different times. To quantify the 
level of transcript, total cell RNA purified from harvested cells is subjected to 
electrophoresis followed by transfer to a filter by pressure blotting. Following 
incubation, the filter is subject to hybridization by a radiolabeled probe 

25 designed to detect the transcript sequence. Additionally, real time PCR with 
total cell RNA can be used for quantitating mRNA degradation rates. Such 
degradation rates are calculated, for example, by densitometric scanning of the 
autoradiographs (Saulnier-Blache et al., Mol. Pharmacol. 50: 1432-42, 1996; 
Yang et al., J. Biol. Chem. 272: 15466-15473, 1997). A decrease in the rate of 
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degradation indicates an increase in mRNA stability. 
Translational Efficiency 

A reticulocyte system can be used to assess the effect of the UTR 
RNA sequences on translational efficiency. In vitro synthesized mRNAs, 

5 derived from corresponding plasmids (up to 40 Mg/ml) are utilized as templates 
for protein synthesis in a nuclease-treated rabbit reticulocyte lysate 
(Amersham). The reactions are performed in the presence of 40 ,uCi of L- 
[ 35 S]methionine (greater than 1 Ci/mM) and 40 U of RNasin (Izquierdo et al., 
Mol. Cell Biol. 17: 5255-5268, 1997; Luis et al., J. Biol. Chem. 268: 1868- 

10 1875, 1993; Santaren et al., J. Biochem. 113: 129-131, 1993). At various times 
for up to 1 hour, the products are separated by SDS-PAGE. Ribosomal 
distribution is indicated by the length (migration) of the transcripts; an increase 
in length indicates an increase in translational efficiency. Efficiency is also 
assessed by monitoring the amount of protein produced; an increase in amount 

15 indicates an increase in translational efficiency. In addition, translational 
efficiency can be monitored by analyzing the distribution of the transcripts on 
polysomes. Transcripts associated with high molecular weight polysomes 
represent actively transcribed messages, those on low molecular weight 
polysomes are associated with poor translation, and those found in the 

20 nonpolysome fraction of a gradient are not being translated. By analyzing the 
percentages associated with each fraction, an estimate of translational 
efficiency can be determined. 

Transcript Distribution 

The poor processing of transcripts may reflect failure of the 
25 transcript to move from the nucleus to associate with translationally active 
ribosomes in the cytoplasm. To assess the effect of the parent UTR sequences 
and the subfragments on this function, the cytosolic versus total cellular 
transcript concentration is compared in cells transfected . Harvested cells are 
lysed and subjected to sucrose gradient fractionation. RNA is precipitated 
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from cell fractions, denatured, and blotted onto a nylon membrane in a slot-blot 
apparatus. Following hybridization to a labeled probe, the transcript RNA 
levels are quantitated for the various fractions. Alternatively, constructs with 
or without the UTR sequence can be in vitro labeled with a fluorescent tag and 
5 transfected into the cell. Cellular distribution of the transcript is then analyzed 
using a fluorescent microscope. If the relative quantity of transcript in the 
cytoplasm compared to total cell transcript RNA is modified when a UTR 
sequence is present, then this UTR sequence affects cytoplasmic transport of its 
associated transcript. 

10 

Screening Assays 

RNA molecules comprised of the disclosed parent nucleic acid 
sequences, or their optimized subfragments, can be used in screening 
procedures to identify compounds that affect the RNA/RBP binding pair 

15 interactions associated with these RNA molecules. In a similar way, novel 
RNA/RBP binding pair interactions associated with these RNA molecules can 
be identified. As an alternative to, or in conjunction with screening for 
RNA/RBP binding pair interactions, screening procedures can be tailored to 
identify compounds that alter the mRNA functionality of the RNA molecules. 

20 The screening protocols can be designed to allow simultaneous assessment of 
the effect of numerous test compounds in a high throughput screening assay, as 
described in further detail in PCT application WO 98/04923. 

Use of the longer parent sequences provides the benefit of more 
physiologically relevant target sequences. On the other hand, use of the 

25 optimized sequences provides the advantage of a smaller alternative molecules 
that can be used when they are better suited to the assay methodology. For 
example, when a large RNA molecule is used in an RBP binding assay, actual 
RBP binding may be difficult to detect because the RNA/RBP binding pair is 
only slightly larger than the RNA molecule alone. In addition, nonspecific 
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interactions with other parts of the parent sequence transcript can interfere with 
detection of specific interactions. This problem can be alleviated by using a 
smaller, optimized subfragment, which has reduced nonspecific binding and an 
equivalent RBP binding activity that is easier to detect because the interacting 

5 RNA/RBP complex is significantly larger than the RNA alone. Furthermore, 
the optimized sequences provide more specific information regarding the actual 
nucleic acid site of RNA/RBP binding pair interaction, which facilitates the 
rational drug design of compounds that affect this interaction. 

The screening protocols for identifying compounds that affect 

10 RNA/RBP binding pair interactions include, for example, filter binding assays 
(Wu and Uhlenbeck, Biochemistry 26: 8221-8227, 1987; Carey and Uhlenbeck, 
Biochemistry 22: 2610-2615, 1983), electrophoretic gel mobility shift assays 
(Izquierdo and Cuezva, Mol. Cell. Biol. 17:5255-5268, 1997; Malter, Science 
246: 664-666, 1989; Zaidi and Malter, J. Biol. Chem. 269: 24007-24013, 1994; 

15 Claffey et al., Mol. Biol. Cell 9: 469-481, 1998; Brewer, Mol. Cell. Biol. 11: 
2460-2466, 1991); homopolymer beads (Siomi et al., Cell 77: 33-39, 1994), or 
fluorescence anisotrophy (Tetin et al., Biochemistry 32: 901 1-9017, 1993; Goss 
et al., Nucleic Acids Research 11: 5589-5602, 1983; and Liang et al., WO 
98/39484) (see generally, WO 98/04923). 

20 With regard to mRNA functionality, compounds are screened for the 

ability to alter the RNA molecule's pre-mRNA processing, or stabilization, 
translational efficiency, localization, sequestration, or editing and splicing 
functions of the mRNA. These functions can be assessed, for example, by 
measuring the expression of the UTR-associated coding region, the half-life of 

25 the transcript (Saulnier-Blache et al., Mol. Pharmacol. 50: 1432-1442, 1996; 
Yang et al., J. Biol. Chem. 272: 15466-15473, 1997), polysomal distribution 
along the transcript (Izquierdo et al., Mol. Cell Biol. 17: 5255-5268, 1997; Luis 
et al., J. Biol. Chem. 268: 1868-1875, 1993; Santaren et al., J. Biochem. 113: 
129-131, 1993), the type of polysome associated with the transcript, or the 
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transcript's intracellular distribution (Yang et al„ 1997, supra). 

In general, extracts, compounds, or chemical libraries that can be 
used in screening assays are known in the art. Examples of such extracts or 
compounds include, but are not limited to, extracts based on plant, fungal, 
5 prokaryotic, or animal sources, fermentation broths, and synthetic compounds, 
as well as modification of existing compounds. Numerous methods are also 
available for generating random or directed synthesis (e.g., semi-synthesis or 
total synthesis) of any number of chemical compounds, including, but not 
limited to, saccharide-, lipid-, peptide-, and nucleic acid-based compounds. 

io Libraries of genomic DNA or cDNA may be generated by standard techniques 
(see, e.g., Ausubel et aL, supra) and are also commercially available (Clontech 
Laboratories Inc., Palo Alto, CA). Nucleic acid libraries used to screen for 
compounds that alter RNA/RBP binding pair interactions or mRNA 
functionality are not limited to the species from which the RNA or RBP is 

15 derived. For example, a Xenopus cDNA may be found to encode a protein that 
alters a human RNA/RBP interaction. 

Synthetic compound libraries are commercially available from 
Brandon Associates (Merrimack, NH) and Aldrich Chemical (Milwaukee, WI). 
Alternatively, libraries of natural compounds in the form of bacterial, fungal, 

20 plant, and animal extracts are commercially available from a number of 

sources, including Biotics (Sussex, UK), Xenova (Slough, UK), Harbor Branch 
Oceanographies Institute (Ft. Pierce, FL), and PharmaMar, U.S.A. (Cambridge, 
MA). In addition, natural and synthetically produced libraries are produced, if 
desired, according to methods known in the art, e.g., by standard extraction and 

25 fractionation methods. 

When a crude extract is found to modulate an RNA/RBP binding 
pair interaction or mRNA functionality, further fractionation of the positive 
lead extract is necessary to isolate chemical constituents responsible. Thus, the 
goal of the extraction, fractionation, and purification process is the 
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characterization and identification of a chemical entity within the crude extract 
having the interaction- or function-modulating activities. The same assays 
described herein for the detection of interactions in mixtures of compounds can 
be used to purify the active component and to test derivatives thereof. Methods 

5 of fractionation and purification of such heterogenous extracts are known in the 
art. If desired, compounds shown to be useful agents for treatment are 
chemically modified according to methods known in the art. 

Compounds which modulate an RNA molecule's RNA/RBP binding 
pair interaction or mRNA functionality may be administered by any appropriate 

10 route for treatment or prevention of a disease or condition associated with the 
expression of the protein endogenously associated with the gene from which 
SEQ ID NOS: 1-20 are derived. Examples of such diseases and conditions 
include neurodegenerative disease, stroke, cardiovascular disease, peripheral 
vascular disease, high blood pressure, cancer, including breast cancer, 

15 inflammatory diseases, such as rheumatoid arthritis, Crohn's disease, diseases 
associated with cellular proliferation, metabolic disorders, such as obesity and 
diabetes, and infectious diseases, such as bacterial or viral infections. 
Administration may be parenteral, intravenous, intra-arterial, subcutaneous, 
intramuscular, intracranial, intraorbital, ophthalmic, intraventricular, 

20 intracapsular, intraspinal, intraci sternal, intraperitoneal, intranasal, aerosol, by 
suppositories, or oral administration. 

Therapeutic formulations may be in the form of liquid solutions or 
suspensions; for oral administration, formulations may be in the form of tablets 
or capsules; and for intranasal formulations, in the form of powders, nasal 

25 drops, or aerosols. 

Methods well known in the art for making formulations are found, 
for example, in "Remington's Pharmaceutical Sciences." Formulations for 
parenteral administration may, for example, contain excipients, sterile water, or 
saline, polyalkylene glycols such as polyethylene glycol, oils of vegetable 
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origin, or hydrogenated napthalenes. Biocompatible, biodegradable lactide 
polymer, lactide/glycolide copolymer, or polyoxyethylene-polyoxypropylene 
copolymers may be used to control the release of the compounds. Other 
potentially useful parenteral delivery systems include ethylene- vinyl acetate 
5 copolymer particles, osmotic pumps, implantable infusion systems, and 
liposomes. Formulations for inhalation may contain excipients, for example, 
lactose, or may be aqueous solutions containing, for example, polyoxyethylene- 
9-lauryl ether, glycholate and deoxycholate, or may be oily solutions for 
administration in the form of nasal drops, or as a gel. The concentration of the 
10 compound in the formulation will vary depending upon a number of factors, 
including the dosage of the drug to be administered, and the route of 
administration. 

The formulations can be administered to human patients in 
therapeutically effective amounts (e.g., amounts which prevent, eliminate, or 

15 reduce a pathological condition) to provide therapy for a disease or condition. 
Typical dose ranges are from about 0. 1 jug/kg to about 1 g/kg of body weight 
per day. The preferred dosage of drug to be administered is likely to depend on 
such variables as the type and extent of the disorder, the overall health status of 
the particular patient, the formulation of the compound excipients, and its route 

20 of administration. 

An additional use for the parent UTR sequences or their optimized 
subfragments is their incorporation into a recombinant construct such that 
expression of the construct is controlled by the UTR sequence. For example, a 
nucleic acid sequence of the invention can be inserted into a heterologous gene 

25 to form all or a part of the untranslated region of the gene's mRNA transcript. 
It is expected that the UTR sequence will interact with an RBP in these 
recombinant RN A molecules and that protein expression of the heterologous 
gene will be affected. This is analogous to recombining promoters with 
heterologous coding regions to alter or control the expression of the coding 
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region. 

Other Embodiments 

All publications and patent applications mentioned in this 
specification are herein incorporated by reference. 

While the invention has been described in connection with specific 
embodiments, it will be understood that it is capable of further modifications. 
Therefore, this application is intended to cover any variations, uses, or 
adaptations of the invention that follow, in general, the principles of the 
invention, including departures from the present disclosure that come within 
known or customary practice within the art. Other embodiments are within the 
claims. 

What is claimed is: 
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Claims 

1 . A nucleic acid sequence comprising any one of the nucleic acid sequences 
of SEQ ID NOS: 1-20, or a subfragment nucleic acid sequence derived from 

5 any one of the sequences of SEQ ID NOS: 1-20, wherein an mRNA molecule 
comprising said sequence has RNA binding protein (RBP) binding activity or 
regulates the functionality of said mRNA. 

2. The nucleic acid sequence of claim 1, wherein said subfragment nucleic acid 
10 sequence is optimized. 

3. The nucleic acid sequence of claim 1, wherein the regulation of mRNA 
functionality comprises an alteration in pre-mRNA processing or in the 
stabilization, translational efficiency, localization, sequestration, editing, or 

15 splicing functions of said mRNA. 

4. A method of identifying an optimized subfragment of any one of the parent 
nucleic acid sequences of SEQ ID NOS: 1-20, said method comprising 
isolating a subfragment nucleic acid sequence from said parent nucleic acid 

20 sequence, assaying RNA molecules comprising said subfragment for RBP 

binding activity or mRNA functionality, and identifying a subfragment nucleic 
sequence that maintains an RBP binding activity and/or mRNA functionality 
that is equivalent to said parent sequence. 

25 5. The method of claim 4, wherein said subfragment nucleic acid sequence is. 
isolated by restriction enzyme digestion. 

6. The method of claim 4, wherein said subfragment is identified by deletion 
mapping. 
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7. The method of claim 4, wherein said mRNA functionality comprises an 
alteration in pre-mRNA processing or in the stabilization, translational 
efficiency, localization, sequestration, editing, or splicing functions of said 



8. A nucleic acid sequence identified as an optimized subfragment of any one 
of SEQ ID NOS: 1-20 by the method of claim 4. 

9. A method of identifying a candidate compound having an effect on an 
10 RNA/RBP binding pair interaction or mRNA functionality, said method 

comprising contacting an RNA molecule comprising at least one nucleic acid 
sequence of any one of SEQ ID NOS: 1-20, or at least one optimized 
subfragment sequence derived from any one of SEQ ID NOS: 1-20, with at 
least one RBP, and at least one test compound, and measuring said RNA/RBP 
15 binding pair interaction and/or mRNA functionality, wherein a candidate 
compound is identified as a test compound that affects said interaction and/or 
functionality. 

10. The method of claim 9, wherein said mRNA functionality comprises an 
20 alteration in pre-mRNA processing or in the stabilization, translational 

efficiency, localization, sequestration, editing, or splicing functions of said 
mRNA. 

1 1. A method for identifying an RBP that interacts with an RNA molecule 

25 comprising the nucleic acid sequence of any one of SEQ ID NOS: 1-20, or an 
optimized subfragment sequence of any one of SEQ ID NOS: 1-20, said 
method comprising contacting said RNA molecule with at least one RBP, and 
measuring RNA/RBP binding pair interactions, wherein detection of said 
interactions identifies said RBP that interacts with said RNA molecule. 



mRNA. 
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SEQUENCE LISTING 
<110> Message Pharmaceuticals, Inc. 

<120> NUCLEIC ACID SEQUENCES AND METHODS FOR 

IDENTIFYING COMPOUNDS THAT AFFECT RNA/RNA BINDING PROTEIN 
INTERACTIONS AND MRNA FUNCTIONALITY 



<130> 50093/014W01 

<150> 09/437,458 
<151> 1999-11-10 

<160> 20 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 278 
<212> DNA 

<213> Homo sapiens 



ctaaQtaact agggaagata atattaagga agacaatgtg aaaagaaaaa tgagcctggc 60 

aaSItgcgt tSLc?tgg tttttaaaaa actgctgact gttttctctt gagagggtgg 

r t c a!t attcgctgtg tcagcataga agtaacttac "aggtgtgg ^ ■ «0 

ataactttgt ttagcccaaa accaagtcaa gtgaaaaagg aggaagagaa aaaatatttt ^ 

cctgccaggc atggaggccc acgcacttcg ggaggtcg 

<210> 2 
<211> 243 
<212> DNA 

<213> Homo sapiens 



oaaqtaacta atgtttgaaa ttttaaagta cttttgggta tttttctgtc atcaaacaaa 60 

acaggJatca gtgcattatt aaatgaatat ttaaattaga cattaccagt aatttcatgt 120 

ctaStttta aaatcagcaa tgaaacaata atttgaaatt tctaaattca tagggtagaa 180 

tcacctgtaa aagcttgttt gatttcttaa agttattaaa cttgtacata taccaaaaag 240 
aag 



<210> 3 
<211> 122 
<212> DNA 

<213> Homo sapiens 



gggcggttgg ctttgttggg tgagcttgtt tgtgtccctg tgggtggacg tggttggtga 60 
ttggcaggat cctggtatcc gctaacagaa ctaggtcaaa atgcagatct tcgtgaaaac 
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122 



CC 

<210> 4 
<211> 127 
<212> DNA 
<213> Homo sapiens 

<400> 4 

ctcccagccc gtctccccgc tccagtttag aacctaattc ccaattcccg gaccgggccc 
agccctgggc tcttactgtc cgcttttgct gggacctgtt ccacaaatgg gcgtcttctg ^ 
ccttggg 

<210> 5 

<211> 190 

<212> DNA 

<213> Homo sapiens 



agcgcccggg caggccaccc cgagcccctt aactgcgcag gcgctctcac tcagaaaggc 
cgctgggtgc gggagcgcag aggcggtgca gggcggctgg ctcgcctcgg cgtgcagtgc 
gcgtgcgtgg agctgggagc taggtcctcg gagtgggcca gagatggcgg cggccgacgg 
ggctttgccg 

<210> 6 

<211> 69 

<212> DNA 

<213> Homo sapiens 

<400> 6 

ttctttgctg agggtcacat tgagctgcag gttgaatccg gggtgccttt aggattcagc 
accatggcg 

<210> 7 

<211> 84 

<212> DNA 

<213> Homo sapiens 

Ilgtccctcc tgtctctgca gcttgttccc ggaagttttg ctgctagtcg cggacgcaat 
ggcttcaagg ttacttcgcg gage 

<210> 8 

<211> 230 

<212> DNA 

<213> Homo sapiens 

actgegaage ggcttcttca gagcaeggge tggaactggc aggcaccgcg agcccctagc 
acccgacaag ctgagtgtgc aggacgagtc cccaccacac ccacaccaca geegctgaat 
gaggcttcca ggcgtccgct cgcggcccgc agagccccgc cgtgggtccg cccgctgagg 
cgcccccagc cagtgegett acctgccaga ctgcgcgcca tggggcaacc 



60 
120 



60 
120 
180 
190 



60 
69 



60 
84 



60 
120 
180 
230 
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<210> 9 

<211> 234 

<212> DNA 

<213> Homo sapiens 

<400> 9 

gtggcgcgag cttctgaaac taggcggcag aggcggagcc gctgtggcac tgctgcgcct 
ctgctgcgcc tcgggtgtct tttgcggcgg tgggtcgccg ccgggagaag cgtgagggga 
cagatttgtg accggcgcgg tttttgtcag cttactccgg ccaaaaaaga actgcacctc 
tggagcggac ttatttacca agcattggag gaatatcgta ggtaaaaatg ccta 

<210> 10 

<211> 260 

<212> DNA 

<213> Homo sapiens 

<400> 10 

ggaaaggcta aagacgatca tgagagagaa atattcaaag tgttcgagct gaatatttta 
atttatgagt ttttgatagc tttatttttt aagtatttat. atatttataa ctcatcataa 
aataaagtat atatagaatc taacagcaat ggcatttaat gtattggcta tgtttacttg 
acaaatgaaa ttatggtttg caacttttag ggaaatcaat ttagtttacc aagagactat 
aaatgctatg gagccaaaac 

<210> 11 

<211> 159 

<212> DNA 

<213> Homo sapiens 

<400> 11 

gtccaggaac tcctcagcag cgcctccttc agctccacag c.cagacgccc tcagacagca 
aagcctaccc ccgcgccgcg ccctgcccgc cgctgcgatg ctcgcccgcg ccctgctgct 
gtgcgcggtc ctggcgctca gccatacagc aaatccttg 

<210> 12 

<211> 124 

<212> DNA 

<213> Homo sapiens 

<400> 12 

actccgcggt atctgcatcg ggcctcactg gcttcaggag ctgaataccc tcccaggcac 
acacaggtgg gacacaaata agggttttgg aaccactatt ttctcatcac gacagcaact 
taaa 

<210> 13 

<211> 262 

<212> DNA 

<213> Homo sapiens 



<400> 13 

taaaatagca acactctata tttagattgt taaaataact agtgttgctt ggactattat 
aatttaatgc atgttaggaa aatttcacat taatatttgc tgacagctga cctttgtcat 



caaaqapgag gggactgggg tcccggattc ttgggtctcc aagaagtctg tccacagact 
llllclltll ctcttcccca tctaggcctg ggcaggaaca tatattattt att aagcaa 
SJttttca tg tt g gggt g 999-ggagg ggaaagggaa jcctgg^tt 9 - 
atgtgagaaa cctttgtgag acagagaaca gggaattaaa tgtgtcatac atatccactt 



gagggcgatt tgt 

<210> 15 

<211> 285 

<212> DNA 

<213> Homo sapiens 



^aqccctcac cccccgcctc ccctacCgtt ctcatggtgt tggcctcgtg "tgctatgc 
gagccctcac G ™r * cc taatcccc ttatcttcat ggaaatggac tgactttatg 

ccJatgaagt ccccaggagc aS ata ctgagaaaac caggctcttt ggggctagac 
agaftggSg agagSgat ctccctctct gagaggagca gcagatgctc acagaccaca 
clcagctcag gccccttgga gcaggatggc tcctctaaga atctc 



180 
240 
262 
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ctttcttcta ttttattccc tttcacaaaa ttttattcct atatagttta ttgacaataa 
tttcaggttt tgtaaagatg ccgggtttta tatttttata gacaaataat aagcaaaggg 
agcactgggt tgactttcag gt 

<210> 14 

<211> 253 

<212> DNA 

<213> Homo sapiens 

<400> 14 _____ „„, = «frfn irracaaact 60 

120 



180 
240 
253 



60 
120 
180 
240 
285 



<210> 16 

<211> 210 

<212> DNA 

<213> Homo sapiens 

£s 5S2S S3=S =s=s = s=5 £ 

gtg««"t £«Lc.« gt«_tct« .ttttg«.t ...c.c.t.t g t g g»aa*«> ISO 
caacatgagc tggtctcttg gcaattattc 

<210> 17 

<211> 239 

<212> DNA 

<213> Homo sapiens 

^tttcat ttctactgtg actgatgtta catcacagtg tttgcaatgg tgttgccctg 60 
X caaggaccag gttattttaa aaagatttgt tttgteuflt gtctttgU 120 

ggcgfctgca cccaggggtg gggaatgt.t gggcagaagg f gaagga t c "gaa.gtgt 
tttctgaata acatttgtgt ggtgggttct ttggaaggag tgagatcatt ttcttatct 

<210> 18 
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<211> 340 
<212> DNA 

<213> Rattus norvegicus 



cctatnagtg aagattggta aaattgtcag tttaacccgg ctgtcctact actaatattt 
aatttttcaa atatgaaaag gtttcagatt ttgtttagat ttatatcaca ttaaacactg 
tcaaataaag gctgttttta tatgcatcgt tgatgttcca aaatgtgaag tctaaatggt 
otctgtattt ccaattatta aataacttct aagatcattt ttaaaagtct gtagatggta 
tggatagcta gttgtttgtt aatataaagt aaaagtagat agctgattta tgttgtacct 
atgtcgtatg tatattaggt atcgtgttgt ctcactaaag 



<210> 19 
<211> 240 
<212> DNA 

<213> Rattus norvegicus 

gccgacgtgt cgaaaggatg ctcaggatat gaatgagcca gcggacgcgt gaatggaggt 6.0 
g^ggaagc gtgaagtaca aggcggggct ccagctgggt gacctcgagg "^caggcc . ™ 
SgSccct cccctctcct cccagcctcc cactccgctc ccctccccgc 99cgcgcaga 180 
ccttccgagc cctcccgagc caccgcgggc aacagagacg ggaccacgca gccaccgagc 240 



<210> 20 

<211> 259 

<212> DNA 

<213> Mus musculus 



gccctctgtc agcagctctc cagcatggtc ctctgacact cctcagatga actgttctca 60 
^cggaagctt gctgcctttt tacaagatga gcttttactc tcttccagga agtagc : ttt 120 



tttctagctg agaattaata atggtctttc tctttggaag tcatatcaaa gUtuttff 
tgggggcctt g^tttgtttt ggtttttgga gacagggtct cactgtgtag tcctagctgg 
cctggaactc actatgtag 



180. 
240 
259 
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